INTEGRAL CRITERIA FOR TRANSPORTATION-COST INEQUALITIES 
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Abstract. In this paper, we provide a characterization of a large class of transportation-cost in- 
equalities in terms of exponential integrability of the cost function under the reference probability 
measure. Our results completely extend the previous works by Djellout, Guilin and Wu [8] and 
Bolley and Villani 



1. Introduction 

In all the paper, (X, d) will be a polish space equipped with its Borel er-field. The set of probability 
measures on X will be denoted by V{X). 

1.1. Norm-entropy inequalities and transportation cost inequalities. The aim of this paper 
is to give necessary and sufficient conditions for inequalities of the following form : 

VveV(X), a(]\v-fj,\\l)<U(y\fj,), (1.1) 

where 

• a : R + — > M + U {+00} is a convex lower semi-continuous (l.s.c) function vanishing at 0, 

• The semi- norm \\v — fi\\^ is defined by 

\\v - fj,\\% := sup I / ipdv- I (pdfi\, (1.2) 
ve* VJx Jx J 

where $ is a set of bounded measurable functions on X which is symmetric, i.e. 

ip e $ => -(p e 

• The quantity H(^ | fi) is the relative entropy of v with respect to /1 defined by 

f djy 
H(z/ I n) = / log — dv, 

if v is absolutely continuous with respect to /i and +00 otherwise. 

Inequalities of the form (|l.lf> were introduced by C. Leonard and the author in They are called 
norm-entropy inequalities. An important particular case, is when <i> is the set of all bounded 1-Lipschitz 
functions on X : $ = BLip 1 (A', d). Indeed, in that case ||f — is the optimal transportation cost 
between v and /i associated to the metric cost function d(x, y). Let us recall that if c : X x X — > IR + 
is a lower semi-continuous function, then the optimal transportation cost between v G V(X) and 
/i G 'P(X) is defined by 

T c (v,n)=w£ / c(x,y)d7r(x,y) (1.3) 
Jx 2 
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where ir describes the set ji) of all probability measures on X x X having v for first marginal 
and n for second marginal. According to Kantorovich-Rubinstein duality theorem (see e.g Theorem 
1.3 of HB1), if the cost function c is the metric d, the following identity holds 



T d (u,fj,)= sup < / cpdv- / tpdfiV. (1.4) 
tpeBUp 1 (X,d) Ux Jx J 

In this setting, inequality l|l.l(l becomes 

Vf6P(4 a{T d {v,n))<H{v\ i i) (1.5) 

Such an inequality is called a convex transportation- cost inequality (convex T.C.I). 



1.2. Applications of transportation-cost inequalities. After the seminal works of K. Marton 
|14l ITS'] and M. Talagrand new efforts have been made in order to understand this kind of 
inequalities. The reason of this interest is the link between T.C.I and concentration of measure 
inequalities. Namely, according to a general argument du to K. Marton, if fi satisfies i[1.5|l . then /j, 
has the following concentration property 

\/A C X s.t. n(A) > -, Ve > r, fi{A e ) > 1 - e~ a{£ ~ r \ 

with r = a _1 (log(2)) and A e = {x £ X : d(x,A) < e). For a proof of this fact, see e.g. Theorem 9 
of Other applications of T.C.Is were investigated in [S], and ^3- In these papers, it was 
shown that T.C.Is are an efficient way for deriving precise deviations results for Markov chains and 
empirical processes. One can also consult |5J and for applications of norm-entropy inequalities 
to the study of conditional principles of Gibbs type for empirical measures and random weighted 
measures. 



1.3. Necessary and sufficient conditions for norm-entropy inequalities. Our main result gives 
necessary and sufficient conditions on fj, for (|l.lfl to be satisfied. Before to state it, let us introduce 
some notations. In all what follows, C will denote the set of convex functions a : M + — ► M + U {+00} 
which are lower semi continuous (l.s.c) and such that a(0) = 0. For a given a, the monotone convex 
conjugate of a will be denoted by a®. It is defined by 

Vs>0, a®(s) = sup {st- a(t)}. 

t>Q 

Note that, if a belongs to C, then a® also belongs to C. Furthermore, one has the relation a® ® — a. 
If a is in C, the Orlicz space L rQ (X, fi) associated to the function r Q :— e a — 1 is defined by 

L Ta (Af, fi) = j/ : X -> K such that 3A > 0, J r a (^j dfi < +00 j , 

where /x almost everywhere equal functions are identified. The space L Ta (X, fj,) is equipped with its 
classical Luxemburg norm || . || Tq , i.e 

V/ £ hr a (X,n), \\f\\ Ta =fnf|A> such that J r a (£j dfj. < lj . 

We will need the following assumptions on a : 
Assumptions. 

(A\) : The effective domain of a® is open on the left, i.e {s £ M + : a®(s) < +00} = [0,b[, for some 
b>0. 

(A 2 ) : The function a® is super- quadratic near 0, i.e 

3s a ® > 0,c Q © > 0, Vse[0,s Q ©], a®(s)>c a »s 2 . (1.6) 
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We can now state the main result of this paper, which will be proved in section 2. 

Theorem 1.7. Let a € C satisfy assumptions (A\) and (A2) and n £ V(X). The following statements 
are equivalent : 

(1) 3a > such that , ~iv G V{X), a ( I^lMi ) < H ( I/ | M ) 



a 

(2) 3M > swc/i that , VV> € ||<p - *(y>, /z) || Tcl < M. 

More precisely, if (1) holds true then one can take M = 3a. Conversely, if (2) holds true, then one 
can take a = \[2vn a M , with m a defined by 

m„ = emax } — I with u£ [0, If sttc/i that : 

it /-sr U 

, < s a ® V Cq and < 2, 

yl — u 1 — u 

where the constants s Q ® and c Q ® are given by \l.b)) . 
Remark 1.8. 

• If $ contains an element which is not [i-a.c constant, and if inequality holds for some 
a G C, then a satisfies assumption A2 (see Lemma \2.1)) . 

• The constant a = \f2m a M is not optimal. This can be easily checked by considering the 
celebrated Pinsker inequality, i.e 

Vi/e fafe <HHM), (1.9) 



where \\v — /J-Wtv is the total-variation norm which is defined by 
\W - m||tv = supjy tpdv -J ipdfi, \<p\ < 1 j 



In order to prove Theorem 11.71 we will take advantage of the dual formulation of norm-entropy 
inequalities developed in [2]. Namely, according to Theorem 3.15 of [21, we have the following result 

Theorem 1.10. The inequality 

VveV(X), <hHm), 

with a € C is equivalent to the following condition : 

V</?e$, V.seR+, j e sv d^<e s{v '^ )+a%{as) . (1.11) 
Jx 

According to (|1.11|) . the only thing to know is how to majorize the Laplace transform of a centered 
random variable X knowing that this random variable satisfies an Orlicz integrability condition of the 

form : E 



< +00, for some A > 0. Estimates of this kind are very useful in probability theory, 

because they enable us to control the deviation probabilities of sums of independent and identically 
distributed random variables. In we have shown how to deduce Pinsker inequality from the 
classical Hoeffding estimate (see Section 2.3 of 52)- We also proved that the weighted version of 
Pinsker inequality i|1.20fl recently obtained by Bolley and Villani in 3 is a consequence of Bernstein 
estimate (see Corollaries 3.23 and 3.24 of ^2])- Here, Theorem II . 71 will follow very easily from the 
following theorem which is du to Kozachenko and Ostrovskii (see and 0] p. 63-68) : 
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Theorem 1.12. Suppose that a S C satisfies Assumptions (Ai) and (A2), then for all f £ l^ Ta {X , n) 
such that f x fd[A = 0, the following holds 

Vs>0, / e s/ d/i< e a@{as \ 
Jx 

with a = \/2m a \\f\\ Ta , where m a is the constant defined in Theorem \1.7\ 

For further informations on the preceding result, we refer to Chapter VII of 11 (p. 193-197) where 
a complete detailed proof is given. Before proving Theorem 11.71 we discuss below some of its appli- 
cations. 

1.4. Applications to T.C.Is. Applying the preceding theorem to the case where <& is the Lipschitz 
ball BLip 1 (A', d), one obtains the following result. 

Theorem 1.13. Let a € C satisfy assumptions (A\) cmG?(A 2 ) and fj, S V (X) be such that J x d(xo, x) d/i(x) < 
+00 for all xq £ X. The following statements are equivalent : 

(1) 3a > such that £ V(X), a f Td ^ ^ < R(i> | //). 

(2) For all Xq € X , the function d(xo, . ) £ h Ta {X , ^1). 

More precisely, if (2) holds true, then one can take a = 2\f2m a inf KoS ^ ||rf(^0j OIItq; where m a was 
defined in Theorem \l//\ 

Actually, other transportation cost inequalities can be deduced from Theorem 1 1.71 Using a majoriza- 
tion technique developed by F. Bolley and C. Villani in [3], we will prove the following result : 

Theorem 1.14. Let c( . , . ) be a cost function such that c(x,y) — q(d(x,y)), where q : M + — > R + is 
an increasing convex function satisfying the A2- condition, i.e 

3K > 0, Vie R+, q(2x) < Kq{x). (1.15) 

If a £ C satisfies assumptions (Ai) and (A2), then for all /i £ V(X) such that J x c(xq, x) dpt,(x) < +00 
for all xq £ X, the following statements are equivalent : 

(1) 3a > 0, W £ V(X), a C^^P ) < H(v | n), 

(2) For all xq £ X , the function c(xo, ■ ) £ L Tq (X , fj,). 

More precisely, if (2) holds true then one can take a — \[2Km a u\i Xo ^x ||c(a;o, . )||r Q ■ Furthermore, if 
dom a = K + then the following inequality holds 

1 / l0£ f e Sa ( c ( x 0,x)) Anlx) \ 

£ V(X), %(y, fi) < V2Km a inf - 1 + Jx = — o~ x (H(i/ | ft)) (1.16) 

x ex, 8>o \ log 2 / 

Contrary to what happens in the case where c is the metric d, a transportation-cost inequality 
a (T c (i>, [i)) < H(i> I fi) can hold even if a does not satisfy Assumption {A2). The most known 
example is Talagrand inequality, also called T2-inequality. Let us recall that a probability measure fj, 
on R" satisfies the Talagrand inequality T2(a) if 

Vi/ G V(X), T d 2 („, (j,) < a R(v I M ), (1.17) 

where d(x, y) — y/Yl7=i( x i ~~ Vi) 2 - Gaussian measures do satisfy a T 2 -inequality. This was first shown 
by Talagrand in J7j. In this case, the corresponding a is a linear function and hence its monotone 
conjugate a® does not satisfy (A2). Sufficient conditions are known for Talagrand inequality. In 
|16| . it was shown by F. Otto and C. Villani that if d[i = e~*G?x is a probability measure on R n 
satisfying a logarithmic Sobolev inequality with constant a, then it also satisfies the inequality ¥2(0). 
Furthermore, if fi satisfies ¥2(0), then it satisfies the Poincare inequality with a constant a/2. An 
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alternative proof of these facts was proposed in pp by S.G. Bobkov, I. Gentil and M. Ledoux. In 
a recent paper P. Cattiaux and A. Guillin gave an example of a probability measure satisfying T2 
but not the logarithmic Sobolev inequality (see [S])- A necessary and sufficient condition for T2 is 
not yet known. Other examples of transportation-cost inequalities involving a linear a can be found 
m HI- HI an d El- The common feature of these T2-like inequalities is that they enjoy a dimension 
free tensorization property (see e.g Theorem 4.12 of |12p which in turn implies a dimension free 
concentration phenomenon. 

1.5. About the literature. Theorems ll 1 14l and ll . 13l extend previous results obtained by H. Djellout, 
A. Guillin and L. Wu in [5] and by F. Bolley and C. Villani in [3J. 

In [5], H. Djellout, A. Guillin and L. Wu obtained the first integral criteria for the so called Ti- 
inequality. Let us recall that a probability measure \x on X is said to satisfy the inequality f\{a) if 



VveP{X), T~ d (v, ^i) 2 < aU(v I /i). (1.18) 

According to Jensen inequality, T d (v, fi) 2 < T d 2(i/,fj,), and thus T2(a) =>• Ti(a). The inequality Ti 
is weaker than T2 and it is also considerably easier to study. According to Theorem 3.1 of |5], the 
following propositions are equivalent : 



(1) 3a > 0, such that /j, satisfies Ti(a) 

(2) 35 > such that f e Sd{x > y)2 d^{x)d^{y) < +c 
Jx 2 



More precisely, if 



IX 2 

a $d(x,y) 



X 2 



dfi(x)dfi(y) < +00 for some 5 > 0, then one can take 



a = — sup 
k>i 



(A! 



2\ i/fc r 



(2fc!) 



fd{x,yf dfM( X )dfJ,(y) 



X 2 



l/k 



< +00. 



(1.19) 



The link between the constants a and 6 was then improved by F. Bolley and C. Villani in [3] (see 
(jl~21) bellow). 

In [3], F. Bolley and C. Villani obtained the following weighted versions of Pinsker inequality : if 
X '■ X — > M + , is a measurable function, then for all v E V(X), 



\\X- {u-n)\\ T v < ( 2 



X 



e 2 ^d^ (y/K(y\iM) + ±E.(v\n) 



\\ X ■ {v - h)\\tv < ^l + log^eX 2 ^v/2H(^ | fi) 



Using the following upper bound (see JS], prop. 7.10) 

T dP (is,v)<2P- 1 \\d(x , .) p -(v-h)\\tv, 



(1.20) 
(1.21) 

(1.22) 



they deduce from i|1.20fl and (|1.21|) the following transportation cost inequalities involving cost func- 
tions of the form c(x, y) = d(x, y) p with p > 1 : W S V{X\ 



T dP {iy lf i)^P <2 inf 

x GX,S>0 



T d p(p,n) < 2 inf 

x GX,S>0 



1 , 

n2 +log 



o Sd(x ,x) p 



X 



25 



1 + log 



dfi(x) 



Up 



YL(v I m) 1 ^ 



R(v I /x) 



1 V2p 



H(i/ I /x) 



l/2p 



(1.23) 
(1.24) 
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Note that for p = 1, the constant in (|1.24[) is sharper than i|1.19fl . Note also that, up to numerical 
factors, l|1.23|) and (|1.24|) are particular cases of l|1.16|l . 

In order to derive T.C.Is from norm-entropy inequalities, we will follow the lines of [3]. To do this, 
we will deduce from Theorem II .71 a general version of weighted Pinsker inequality (see Theorem 12. 711 . 
Theorem 1 1 . 1 41 will follow from Theorem 12 . 71 and from Lemma \'d . 21 which generalizes inequality 11.2211 . 



2. Necessary and sufficient conditions for norm-entropy inequalities. 
Let us begin with a remark on Assumption (A2). 

Lemma 2.1. Suppose that $ contains a function ipo which is not p,-almost everywhere constant. If 
ji satisfies the inequality 

VveV(X), a(\\u-n\\%)<K(v\ii), 
then a satisfies Assumption (^.2)- 

Proof. Let us define A ipo (s) = log f x e s(po d\x, for all s G R. According to Theorem 1 1.1 1)1 we have 

Vs>0, A V0 (a) - s(<p , 11) <a®{s). 

It is well known that 

hm 2 = - Var^(^o) > 0. 

a®(s) 

From this follows that liminf — - — > 0, which easily implies l|1.6|) . □ 

s^0+ S 2 

Remark 2.2. Note that if all the elements 0/$ are [i-almost everywhere constant, then \v — = 
for all v <^ /i. Inequality is thus satisfied, for all a G C . 

The rest of this section is devoted to the proof of Theorem 11.71 The following lemma will be useful 
in the sequel : 

Lemma 2.3. Let X be a random variable such that E [e 5 ' x '] < +00, for some 8 > 0. Let us denote by 
Ax the Log-Laplace of X , which is defined by Ax{s) = logE [e 8 ^], and by A x its Cramer transform 
defined by A* x (t) = sup s6R {st — Ax(s)}, then the following upper-bound holds : 

l + e 



Ve € [0,1[, 



< 



1 



Proof. (See also Lemma 5.1.14 of [7].) Let a < b with a G RU{— 00} and b G IRU{+oo} be the endpoints 
of dom A x . Since A^ is convex l.s.c, {A^ < t} is an interval with endpoints a < a(t) < b(t) < b, for 
all t > 0. As a consequence, 

Vt > 0, V(A X (X) >t)= F(X < a(t)) + F(X > b(t)). 

Let m = E[X]. Since A* x (m) = 0, a(t) < m. But for all u < m, it is well known that 

P(X < u) < exp(-A* x (u)) (2.4) 

If a(t) > a, the continuity of A^ on ]a,b[ easily implies that A* x (a(t)) = t. Thus, according to l|2.4|l . 

¥(X < a(t)) < e~K 

If a(t) = a, then 

(0 (a) 
P(X < a) = lim P(X < a - 1/n) < lim cxp(-A^(a - 1/n)) = lim = 0, 
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where (i) comes from ()2.4[) and (ii) from a — 1/n dom . 

Therefore, in all cases F(X < a(t)) < e _i . In the same way, we have P(A > b(t)) < e _t . As a 
consequence, 

Vi>0, F(A* X (X) > t) < 2e~ t . (2.5) 
Finally, integrating by parts and using l|2.5|) in (*) bellow, we get 

E 



e t F(k x {X)>t/e) dt= I e* dt + I e*¥(A x (X) > t/e) dt 

OO J — OO J 



< 1 + 2 [ + °° e^-Wdt-. ' + : 







1-e 

□ 



Now, let us prove Theorem II .71 

Proof of Theorem \l .7} Let us show that (1) implies (2). For <p G $, according to Theorem 1 1 . 1 Ul and 
using the fact that — ip G we have 



VseM, log / e'te-to'ri d/i < a®{\as\). (2.6) 
Jx 

Define tp :— ip — (</?, /i) and A^(s) := log J*^ e 8 ^ - ^'^ d/i. Equation l|2.6[) immediately yields 
Vt £ K, a ( — ) = sup {st - a® (|as|)} < sup {s< - A^(s)} = Ai(t). 

According to Lemma IO f x e eA ?® d/i < ±±§, for all e G [0, 1[. Thus J x e Ea ^)dfj, < i±|. Since 
a is convex and a(0) = 0, we have a ( < ea f^Y Therefore, J*^ e a (~«~^dfi < In 

other words, 

V^G$, Vee[0,l[, J r a (^jdfi<^-. 
It is now easy to see that ||v?|| T < 3a, for all ip G $. 
Now let us show that (2) implies (1). According to Theorem ll.121 

Jx 

for all £ <3>. As it is assumed that \\ip — (tp, /x)|| Tq < M, for all 93 € $, we thus have 

G $, Vs > 0, / e sip dfi < e s(e,v)+c*®(as)^ 
Jx 

with a = ^/2m a M . According to Theorem 1 1.1 01 this implies that /i satisfies the inequality 

G V(X), a < H(i/ I m)- 

□ 
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Example : Weighted Pinsker inequalities. Let \ : X — > R + be a measurable function and let 
$ x be the set of bounded measurable functions tp on X such that \<p\ < X- I n this framework, it is 
easily seen that 

\\ v - mII* x = llx ■ - m)IItv, 

where H7II7V denotes the total-variation of the signed measure 7. 

Theorem 2.7. Suppose that § x \d\i < +00 and that a £ C satisfies Assumptions (Ax) and (A2), 
then the following propositions are equivalent : 

(1) 3a > 0, such that € V{X), a \ " X ' ^ ~ ^Wtv \ < R ^ | ^ 



(2) X eL Tci (A». 

More precisely, if x € L Tcv (A', /i), t/ien one can tafce a = 2v2?7i C |.||x|| Tci . Conversely, if (1) holds true, 
then 

j 3a, if /x /ias no atoms 



" X " Tq - \ 3a + f x xdn- ||l||r Q , otherwise 
Furthermore, the Luxemburg norm ||xl|r Q can 6e estimated in the following way : 

[ 1 / log Le^^d/A 1 
' ^ d ° m Q - R+ ' thm Mr. < )nf I - s (l + S \ 2 J ] 

• //dom a = [0,r a [ or [0,r a ], then TL Ta (X,n) = L 00 (A',/i) and 

r^HxIU < IMk < sup{t > : a(t) < k^}" 1 • ||xl|oo- 

Remark 2.8. If a £ C satisfies Assumptions (Ax) and (A%) and is such that dom a — R + , we have 
thus shown the following weighted version of Pinsker inequality : 



Mv£V(X), \\ X - {y - At) || rv < 2V2m a inf U (l + ^J^^ | cT 1 (Hfr | M )) (2.9) 



Inequality completely extends Bolley and Villani's results \1.2(fy and fl.21\) . The proof of Bolley 

and Villani is very different from ours. Roughly speaking, it relies on a direct comparison of the two 
integrals f x X % - 1 d/x and f x % log ^d/i. 



Proof of Theorem \2. 7} According to Theorem 1 1.71 it suffices to show that 

oil 11 rn / \n 1 ^ / IIxIIt-c ^ M is non-atomic , . 

2||x|k> i up{||^-(^M)lk}>{ ||x|k-^x^-||l|k otherwise. ^ 

Let us prove the first inequality of l|2.10|l : If p £ <E> X , then \ip\ < X , thus \\ip — (<^,M)llr a < IIxI|t q + 
|| (tp, fi)\\ Ta ■ Thanks to Jensen inequalty, for all A > 0, we have J x r a ^ ^'jf 1 J dfx < J x r a (^) dji. Thus, 
||((^,/i)|| TQ < ||v||T a , which proves the desired inequality. 

Thanks to triangle inequality sup ¥ , e$x \\tp - ((p, fi)\\ Ta > \\x ~ (x,^)l|r« > \\x\\r a - \\ f x xdfJ>\\ Ta = 
IMk - J x xdfJ- - \\l\\r a - 

Suppose that \i has no atoms, then x - /-* nas n0 atoms too. As a consequence, there exists a measurable 
set A C X such that J AX dfj, = \ f x xd[i. Define x — X^A — X^a<=- Then |x| = x and (x>m) = 0. 
Thus sup v6$x \\<p- (<p,n)\\ Ta > \\X~ (x,M>lk = llxlk = llxllrc- 

Now, let us explain how to majorize the Luxemburg norms. Suppose that dom a = R + . If ||xlk < 

I or if / e a ( Sx ' dfi = +00, there is nothing to prove. Let us assume that ||xlk > \ and that 
Jx 
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/ e Q ^ x ' <i/i < +00. Then, denoting A = ||xIIt q , we have 
J x 

f rx\ ] 5X (u) f fX\ (Ul) f 

J expa^ — J dfi < J exp 5\a J dpi < J expa(5x)dfi 



2&x (±) 



Ix x/v/ J Jx J x 

where (i) come from the definition of A = ||xllr Q , (ii) from Jensen inequality and (iii) from the 
inequality a(x/M) < a(x)/M, for all M > 1. Taking the log in both side of the above inequality 
yields A < Sl ^„ 2 fx ex P a ($x) d^- Thus in any case, 



exp a (Sx) d[i, 



5 5\og2j x 
for all S > 0, which is the desired results. 

The case where dom a is a bounded interval is left to the reader. 



□ 



Remark 2.11. It is easy to show that when a(x) ~ x 2 , the Luxemburg norm \\x\\t 2 can be estimated 
in the following way : 



X t 2 < mi 71 H i 7. • 

" * 2 s>o5\ log 2 



M^it/i t/iis upper-bound, one obtains 



| X .(,-,)|| TV <2^infIJl + l0g ^ e52X2 ^ 



<5>0 S 



log 2 



• V2H(i/|a*), 



(2.12) 



which differs from ll.21\) only by numerical factors. The following proposition gives a way to improve 
the constants in the preceding inequality. 

Proposition 2.13. For every measurable function % : A" — > R + , the following inequality holds 



\\X-iy- fOllrv < jnf \\h + 41og / e s ' 2 x 2 dpi • ^2H(i/ | /i). 
5 >° V J* 



(2.14) 

< +00 one has the 
(2.15) 

Let X be an independent copy of X. According to Jensen inequality, we have E [e 8 !"' 1 ')] < 



x 

Proof. First let us show that if X is a real random variable such that E 
following upper bound : 

2s 7 

Vs > 0, 



~ e s(X-E[X])~ 


< e s2/2 -E 


'e x2 ' 









E 



sequently, 
E 



The random variable X — X is symmetric, thus E (X — X) 2k+1 

{X - X) 2k 



= 0, for all k. Con- 



~ e s(X-E[X])~ 


< E 











It is easily seen that 



But if s > 1, one has 



+00 s 2fe E 

E — 



eS =(X-X)V2j < E 

Vs < 1, E 



(X-AT) 2fc | +°o s 2fe E 

^E — 

k=l 



(2k) 



2 k ■ k\ 



'x 1 



, and if s < 1, 



x 



< E 



E 



,A'-' 



D s 2 (x-xy/2 



2s- 



Hence, 



,s(X-E[X]) 



< E 



2s z 



< e s2 /2 . E 


'e x2 ~ 


2 <e s2 / 2 -E 
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So, the inequality 

2s 2 



e s(X-E[X]) 



< e s 12 ■ E 



E 

holds for all s > 0. 

Let ip be a bounded measurable function such that \ip\ < Applying inequality (|2.15[) . one obtains 
immediately 

x 



with M = yl + 4 log J^, e 52 * 2 d/i. Thus, according to Theorem 1 1 . 1 Ul the following norm-entropy in- 
equality holds : 

\\X'{v- »)\\tv < ^l + 41og^_ e x 2 ^ • V2HH7I). 

Replacing % by (5% and using homogeneity one obtains (|2.14|) . □ 
Remark 2.16. Note that {2.1$ is sharper than \2.12}) . But \1. 21}) is still sharper than {2.1$ . 

3. Applications to transportation cost inequalities. 

In this section, we will see how to derive transportation-cost inequalities from norm-entropy inequal- 
ities. Let us begin with the proof of Theorem II. 131 

Proof of Theorem 1 1.1 31 First let us show that (1) implies (2). According to Theorem 11.71 one has 

su Pi^GBLip 1 (Ar d) \W~ (V) A*) Htc < 3a. In particular, using an easy approximation technique, ||d(xo, .) — 
(d(xo, .),(i)\\ Ta < 3a, and thus d(xo, •) G L, Ta (X , jj,). 

Now let us see that (2) implies (1). Let xq G X ; observe that T d (v,n) — \\v — /z||$ , with $ Xo = 
G BLip 1 (Af, d) : ip(x ) = 0}. But $ Xo C $ Xo := : Vie G A", |<p(a;)| < d(x , a;)}. Thus, T d (v, (i) < 
/"Hi = 11^(^0) ■ ) • aOIItv- Applying Theorem l2.7l one concludes that if d(xn. .) £ ~L Ta (X , fx) , 

then the inequality W G P(Af), a f T "^ ) < H(i/ | /i) holds with a = 2 v / 2m Q ||d(a;o, .)|| Ta . As this 
is true for all £0 G X, the same inequality holds for a = 2\f2m a mf xoe x ||d(xo, ■ )\\r a - ^ 

When the cost function is of the form c(x, y) — q(d(x,y)), we will use the following result which is 
adapted from Proposition 7.10 of JS] : 

Lemma 3.1. Let c be a cost function on X of the form c(x,y) = q(d(x,y)), with q : K. + — * M + an 
increasing convex function. Let xq G X and define Xx B (x) — ^q(2d(x,xo)), for all x £ X. Then the 
following inequality holds : 

VveV(X), q{T d {v^))<T c {v^)<\\ Xxa -(v-ij)\\TV. (3.2) 

Proof. For all tt G II(j/, fi), Jensen inequality yields q I / d(x,y)dn(x,y)\ < / q(d(x,y)) dn(x,y). 

\Jx 2 J Jx 2 

Thus according to the definition of T c (v, /z) (see (|1.3f) K one deduce immediately the first inequality in 

l|3.2|l . It follows from the triangle inequality and the convexity of q that 

c(x, y) = q(d(x, y)) < q(d(x, x ) + d(y, y j) < ~ [q{2d(x, x j) + q{2d(y, x ))] = Xx„ (x) + Xx (v)- 

Thus c(x,y) < d Xxo (x,y), with d XtCQ (x,y) = (xx (x) + Xx (y))^{x^ y } and consequently T c {v,^i) < 
T d (v, fi). But T d (v, fi) — Wxxo ■ {y — m)I|tv (see for instance, Prop. VI. 7 p. 154 of E]), which 
proves the second part of 13.2(1 . □ 
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Using the second part of inequality l).'S.2[) together with Theorem 12.71 one immediately derives the 
following result which is the first half of Theorem 1 1.141 : 

Proposition 3.3. Let c be a cost function on X of the form c(x, y) — q(d{x, y)), with q : M + — > K. + 
an increasing convex function and a S C satisfying Assumptions (A%) and (A2). Then the following 
T.C.I holds 

Wv€P(X), a < H(^ I n), (3.4) 



with a — V2m a inf ||q(2d(xo, -))||t_- Furthermore, if q satisfies the A2- condition 11.15}) with con- 
stant K > 0, then one can take a = \plKm a inf ||c(x_, . )||t q - 

x €X 

Remark 3.5. If q satisfies the A2- condition and dom a = M + 7 then /1 satisfies the following T.C.I : 

1 / loe f e Sa(c(xo,x)) du( x )\ 

Vv S V(X), %{v, /i) < V2Km a inf - 1 + - ^ a" 1 (H(i/ | //)) 

xo<£X,8>Q5 \ log2 I 

Now, let us prove the second half of Theorem 1 1.1 41 : 

Proposition 3.6. Let c be a cost function on X of the form c(x, y) = q(d(x, y)), with q : R + — > R + 
an increasing convex function satisfying the ^-condition 11.15}) with a constant K > and let a G C 
satisfy Assumption (Ai). If J x c(xo,x) dfi(x) < +00 for all xq G X and if the T.C.I \3.4\j holds for 
some a > 0, then the function c(xq, . ) belongs to L Ta (Af,/i) for all xq G X. 

Proof. According to the first part of inequality Q3.2|l . q{Td{y, fj)) < T c (v,y), thus, if (|3.4|1 holds 
for some a > 0, then a (Td(v, /J,)) < K(u | /i), for all v G V(X), where a(x) — ct(j&^. Ac- 
cording to Theorem 11.71 this implies that sup \\<P — (VjA*)!!^ < 3. In particular, using an 

tpZBUp^X ,d) 

easy approximation argument, it is easy to see that ||d(a;o, .) — (d(xo, .),//)|| Ta < 3, which implies 
that d(xo, .) G h T ~(X, fj). Let A > be such that J x T5 nME°>EL \ dfj,(x) < +00 and let n be a 
positive integer such that 2™ > A. Then, according to the A2 condition satisfied by q, one has 
q (f) > q (#r) > j^q(x), for all x G M+. Consequently, r a (f) > r a (f^r), for all x G E+. From 
this follows that 



C(X0 > X ^ dti*) =[rj <^%£^ Mx) <[ T J fefO , ^ < +oc 



and thus c(xo, . ) G L Tq (A", /i). □ 
Proof of Theorem \l.lJ\ Theorem II . 141 follows immediately from Propositions 13 . 31 and 13 .bl □ 
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